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Abstract — We study sensor networks with energy harvesting 
nodes. The generated energy at a node can be stored in a 
buffer. A sensor node periodically senses a random field and 
generates a packet. These packets are stored in a queue and 
transmitted using the energy available at that time at the node. 
For such networks we develop efficient energy management 
policies. First, for a single node, we obtain policies that are 
throughput optimal, i.e., the data queue stays stable for the 
largest possible data rate. Next we obtain energy management 
policies which minimize the mean delay in the queue. We 
also compare performance of several easily implementable sub- 
optimal policies. A greedy policy is identified which, in low SNR 
regime, is throughput optimal and also minimizes mean delay. 
Next using the results for a single node, we develop efficient 
MAC policies. 

Keywords: Optimal energy management policies, energy 
harvesting, sensor networks, MAC protocols. 

I. Introduction 

Sensor networks consist of a large number of small, inex- 
pensive sensor nodes. These nodes have small batteries with 
limited power and also have limited computational power 
and storage space. When the battery of a node is exhausted, 
it is not replaced and the node dies. When sufficient number 
of nodes die, the network may not be able to perform 
its designated task. Thus the life time of a network is an 
important characteristic of a sensor network and it is tied up 
with the life time of a node. 

Various studies have been conducted to increase the life 
time of the battery of a node by reducing the energy intensive 
tasks, e.g., reducing the number of bits to transmit ([3], [15]), 
making a node to go into power saving modes: (sleep/listen) 
periodically ([21]), using energy efficient routing ([18], [24]) 
and MAC ([25]). Studies that estimate the life time of a 
sensor network include [18]. A general survey on sensor 
networks is [1] which provides many more references on 
these issues. 

In this paper we focus on increasing the life time of the 
battery itself by energy harvesting techniques ([9], [14]). 
Common energy harvesting devices are solar cells, wind tur- 
bines and piezo-electric cells, which extract energy from the 
environment. Among these, solar harvesting energy through 
photo-voltaic effect seems to have emerged as a technology 
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of choice for many sensor nodes ([14], [16]). Unlike for 
a battery operated sensor node, now there is potentially an 
infinite amount of energy available to the node. Hence energy 
conservation need not be the dominant theme. Rather, the 
issues involved in a node with an energy harvesting source 
can be quite different. The source of energy and the energy 
harvesting device may be such that the energy cannot be 
generated at all times (e.g., a solar cell). However one may 
want to use the sensor nodes at such times also. Furthermore 
the rate of generation of energy can be limited. Thus one 
may want to match the energy generation profile of the 
harvesting source with the energy consumption profile of 
the sensor node. It should be done in such a way that the 
node can perform satisfactorily for a long time, i.e., at least 
energy starvation should not be the reason for the node to 
die. Furthermore, in a sensor network, the MAC protocol, 
routing and relaying of data through the network may need to 
be suitably modified to match the energy generation profiles 
of different nodes, which may vary with the nodes. 

In the following we survey the literature on sensor net- 
works with energy harvesting nodes. Early papers on energy 
harvesting in sensor networks are [10] and [17]. A practical 
solar energy harvesting sensor node prototype is described 
in [8]. A good recent contribution is [9]. It provides various 
deterministic theoretical models for energy generation and 
energy consumption profiles (based on (<r, p) traffic models 
and provides conditions for energy neutral operation, ie., 
when the node can operate indefinitely. In [7] a sensor node 
is considered which is sensing certain interesting events. The 
authors study optimal sleep-wake cycles such that the event 
detection probability is maximized. A recent survey is [14] 
which also provides an optimal sleep-wake cycle for solar 
cells so as to obtain QoS for a sensor node. 

MAC protocols for sensor networks are studied in [23], 
[25] and [26]. A general survey is available in [1] and 
[12], Throughput optimal opportunistic MAC protocols are 
discussed in [13]. 

In this paper we summarize our recent results ([20]), on 
sensor networks with energy harvesting nodes and based 
on them propose new schemes for scheduling a MAC for 
such networks. The motivating application is estimation of 
a random field which is one of the canonical applications 
of sensor networks. The above mentioned theoretical studies 
are motivated by other applications of sensor networks. In 
our application, the sensor nodes sense the random field 
periodically. After sensing, a node generates a packet (pos- 
sibly after efficient compression). This packet needs to be 
transmitted to a central node, possibly via other sensor nodes. 



In an energy harvesting node, sometimes there may not 
be sufficient energy to transmit the generated packets (or 
even sense) at regular intervals and then the node may need 
to store the packets till they are transmitted. The energy 
generated can be stored (possibly in a finite storage) for later 
use. 

Initially we will assume that most of the energy is con- 
sumed in transmission only. We will relax this assumption 
later on. We find conditions for energy neutral operation of 
a node, i.e., when the node can work forever and its data 
queue is stable. We will obtain policies which can support 
maximum possible data rate. 

We also obtain energy management (power control) poli- 
cies for transmission which minimize the mean delay of the 
packets in the queue. 

We use the above energy mangement policies to develop 
channel sharing policies at a MAC (Multiple Access Chan- 
nel) used by energy harvesting sensor nodes. 

We are currently investigating appropriate routing algo- 
rithms for a network of energy harvesting sensor nodes. 

The paper is organized as follows. SectionlHldescribes the 
model for a single node and provides the assumptions made 
for data and energy generation. Section [Til] provides condi- 
tions for energy neutral operation of a node. We obtain stable, 
power control policies which are throughput optimal. Section 
IfVl obtains the power control policies which minimize the 
mean delay via Markov decision theory. A greedy policy 
is shown to be throughput optimal and provides minimum 
mean delays for linear transmission. Section [V] provides a 
throughput optimal policy when the energy consumed in 
sensing and processing is non-negligible. A sensor node with 
a fading channel is also considered. Section [VI] provides 
simulation results to confirm our theoretical findings and 
compares various energy management policies. Section IVHI 
introduces a multiple access channel (MAC) for energy 
harvesting nodes. Section rVIIII provides efficient energy man- 
agement schemes for orthogonal MAC protocols. Sections 
iLXl and [X] consider MACs with fading channels (orthogonal 
and CSMA respectively). Section |XI] compares the different 
MAC policies via simulations. Section IXHI concludes the 
paper. 

II. Model for a single node 

In this section we present our model for a single energy 
harvesting sensor node. 

We consider a sensor node (Fig. Q]) which is sensing a 
random field and generating packets to be transmitted to a 
central node via a network of sensor nodes. The system is 
slotted. During slot k (defined as time interval [k, k + 1], 
i.e., a slot is a unit of time) X k bits are generated by 
the sensor node. Although the sensor node may generate 
data as packets, we will allow arbitrary fragmentation of 
packets during transmission. Thus, packet boundaries are not 
important and we consider bit strings (or just fluid). The bits 
X k are eligible for transmission in (k+ l)st slot. The queue 
length (in bits) at time k is q k . The sensor node is able to 
transmit g(T k ) bits in slot k if it uses energy T k . We assume 
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Fig. 1. The model 



that transmission consumes most of the energy in a sensor 
node and ignore other causes of energy consumption (this 
is true for many low quality, low rate sensor nodes ([16])). 
This assumption will be removed in Section [V] We denote 
by E k the energy available in the node at time k. The sensor 
node is able to replenish energy by Y k in slot k. 

We will initially assume that {Xk} and {Y k } are iid 
(independent, identically distributed) but will generalize this 
assumption later. It is important to generalize this assumption 
to capture realistic traffic streams and energy generation 
profiles. 

The processes {q k } and {E k } satisfy 



Qk+i 
Ek+i 



(qk - 



g(T k )) + +X k , 
-T k ) + Y k . 



(1) 

(2) 



where T k < E k . This assumes that the data buffer and the 
energy storage buffer are infinite. If in practice these buffers 
are large enough, this is a good approximation. If not, even 
then the results obtained provide important insights and the 
policies obtained often perform well for the finite buffer case. 

The function g will be assumed to be monotonically 
non-decreasing. An important such function is given by 
Shannon's capacity formula 



g(T k ) 



■0Tk) 



(3) 



for Gaussian channels where (3 is a constant such that (3 T k 
is the SNR. This is a non-decreasing concave function. At 
low values of T k , g(T k ) ~ j3\ T k , i.e., g becomes a linear 
function. Since sensor nodes are energy constrained, this is 
a practically important case. Thus in the following we limit 
our attention to linear and concave nondecreasing functions 
g. We will also assume that g(0) = which always holds in 
practice. 

Many of our results (especially the stability results) will be 
valid when {X k } and {Y k } are stationary, ergodic. These as- 
sumptions are general enough to cover most of the stochastic 
models developed for traffic (e.g., Markov modulated) and 
energy harvesting. 

Of course, in practice, statistics of the traffic and energy 
harvesting models will be time varying (e.g., solar cell energy 
harvesting will depend on the time of day). But often they 
can be approximated by piecewise stationary processes. For 



example, energy harvesting by solar cells could be taken 
as being stationary over one hour periods. Then our results 
could be used over these time periods. Often these periods 
are long enough for the system to attain (approximate) 
stationarity and for our results to remain meaningful. 

In Section [Ell] we study the stability of this queue and 
identify easily implementable energy management policies 
which provide good performance. 

III. Stability 

We will obtain a necessary condition for stability. Then we 
present a transmission policy which achieves the necessary 
condition, i.e., the policy is throughput optimal. The mean 
delay for this policy is not minimal. Thus, we obtain other 
policies which provide lower mean delay. In the next section 
we will consider policies which minimize mean delay. 

The proofs of Lemma 1 and Theorems 1-4 are provided 
in [20]. 

Let us assume that we have obtained an (asymptoti- 
cally) stationary and ergodic transmission policy {T k } which 
makes {q k } (asymptotically) stationary with the limiting 
distribution independent of qo. Taking {T k } asymptotically 
stationary seems to be a natural requirement to obtain 
(asymptotic) stationarity of {q k }- 

In the following X, T, Y will denote generic r.v.s with the 
distributions of X\ , T\, Y\ respectively. 

Lemma 1 Let g be concave nondeceasing and {X k }, {Yk} 
be stationary, ergodic sequences. For {T k } to be an asymp- 
totically stationary, ergodic energy management policy that 
makes {qk} asymptotically stationary with a proper station- 
ary distribution w, it is necessary that E[X] < E^[g(T)] < 

g{E[Y]). ■ 

Let 

T k = min(E k , E[Y] - e) (4) 

where e is an appropriately chosen small constant (see 
statement of Theorem 1). We show that it is a throughput 
optimal policy, i.e., using this T k with g satisfying the 
assumptions in Lemma 1, {q k } is asymptotically stationary 
and ergodic. 

Theorem 1 If {Xfc},{Yfc} are stationary, ergodic 
and g is continuous, nondecreasing, concave then 
if E[X] < g(E[Y]), © makes the queue stable 
(with e > such that E[X] < g(E[Y] - e)), i.e., 
it has a unique, stationary, ergodic distribution and 
starting from any initial distribution, q k converges 
in total variation to the stationary distribution. 

I 

Henceforth we denote the policy by TO. 

From results on GI/GI/1 queues ([2]), if {Xk} are iid, 
E[X] < g(E[Y]),T k = min{E kl E[Y]-e) and E[X a ] < oo 
for some a > 1 then the stationary solution {q k } of (HJ 
satisfies < oo. 

Taking T k = Y k for all k will provide stability of the queue 
if E[X] < E[g(Y)]. If g is linear then this coincides with the 
necessary condition. If g is strictly concave then E[g(Y)] < 
g(E[Y]) unless Y = E[Y]. Thus T k = Y k provides a strictly 



smaller stability region. We will be forced to use this policy 
if there is no buffer to store the energy harvested. This shows 
that storing energy allows us to have a larger stability region. 
We will see in Section[Vl]that storing energy can also provide 
lower mean delays. 

Although TO is a throughput optimal policy, if q k is 
small, we may be wasting some energy. Thus, it appears 
that this policy does not minimize mean delay. It is useful 
to look for policies which minimize mean delay. Based on 
our experience in [19], the Greedy policy 

T fe = min(E k , f(q k )) (5) 

where / = g^ 1 , looks promising. In Theorem 2, we will 
show that the stability condition for this policy is E[X] < 
E[g(Y)] which is optimal for linear g but strictly suboptimal 
for a strictly concave g (just as the policy T k = Y k discussed 
above). We will show in Section [IV] that when g is linear, 
(O is not only throughput optimal, it also minimizes long 
term mean delay. 

For concave g, we will show via simulations that (O 
provides less mean delay than TO at low load. However 
since its stability region is smaller than that of the TO policy, 
at E[X] close to E[g(Y)}, the Greedy performance rapidly 
deteriorates. Thus it is worthwhile to look for some other 
good policy. Notice that the TO policy wastes energy if 
q k < g(E[Y] — e). Therfore, we can improve upon it by 
saving the energy — e — g^ 1 (q k )) and using it when 

the q k is greater than g(E[Y] — e). However for g a log 
function, using a large amount of energy t is also wasteful 
even when q k > g(t). Taking into account these facts we 
improve over the TO policy as 

T k = min(g-\q k ),E k ,0.99(E[Y] + 0.001(£ fc - cq k )+)) 

(6) 

where c is a positive constant. The improvement over the TO 
also comes from the fact that if E k is large, we allow T k > 
E[Y] but only if q k is not very large. The constants 0.99 and 
0.001 were chosen by trial and error from simulations after 
experimenting with different scenarios. 

We will see in Section [Vl] via simulations that this policy, 
to be denoted by MTO can indeed provide lower mean delays 
than TO at loads above E[g(Y)}. 

One advantage of over (© and © is that while using 
dU, after some time T k = E[Y] — e. Also, at any time, 
either one uses up all the energy or uses E[Y] — e. Thus one 
can use this policy even if exact information about E k is 
not available (measuring E k may be difficult in practice). In 
fact, (0]i does not need even q k while (01 either uses up all 
the energy or uses f(q k ) and hence needs only q k exactly. 

Now we show that under the greedy policy (© the 
queueing process is stable when E\X\ < E[g(Y)]. In next 
few results (Theorems 2 and 3) we assume that the energy 
buffer is finite, although large. For this case Lemma 1 and 
Theorem 1 also hold under the same assumptions with slight 
modifications in their proofs. 

Theorem 2 If the energy buffer is finite, i.e., E k < e < oo 



(but e is large enough) and E[X] < E[g(Y)] then under the 
greedy policy (0, (qk, Ek) has an Ergodic set. ■ 
The above result will ensure that the Markov chain 
{(qk, Ek)} is ergodic and hence has a unique stationary dis- 
tribution if {(qk,Ek)} is irreducible. A sufficient condition 
for this is < P[X k = 0] < 1 and < P[Y k = 0] < 1 
because then the state (0, 0) can be reached from any state 
with a positive probability. In general, {(qk,Ek)} can have 
multiple ergodic sets. Then, depending on the initial state, 
{(qk,Ek)} will converge to one of the ergodic sets and the 
limiting distribution depends on the initial conditions. 

IV. Optimal Policies 

In this section we choose Tk at time k as a function of qk 
and Ek such that 



E 



U=0 



a k q k 



is minimized where < a < 1 is a suitable constant. The 
minimizing policy is called a-discount optimal. When a = 1, 
we minimize 



lim sup— E 



n-1 
,fc=0 



This optimizing policy is called average cost optimal. By 
Little's law ([2]) an average cost optimal policy also mini- 
mizes mean delay. If for a given (qk, Ek), the optimal policy 
Tk does not depend on the past values, and is time invariant, 
it is called a stationary Markov policy. 

If {Xk} and {Yfe} are Markov chains then these optimiza- 
tion problems are Markov Decision Problems (MDP). For 
simplicity, in the following we consider these problems when 
{Xk} and {Yk} are iid. We obtain the existence of optimal 
a-discount and average cost stationary Markov policies. 

Theorem 3 If g is continuous and the energy buffer is 
finite, i.e., < e < oo then there exists an optimal a- 
discounted Markov stationary policy. If in addition E[X] < 
g(E[Y]) and E[X 2 ] < oo, then there exists an average cost 
optimal stationary Markov policy. The optimal cost v does 
not depend on the initial state. Also, then the optimal a- 
discount policies tend to an optimal average cost policy as 
a — ► 1. Furthermore, if v a (q,e) is the optimal a-discount 
cost for the initial state (q, e) then 

lim(l - a) mSr g e )V a (q, e) = V 

a— >1 

■ 

In Section [HI] we identified a throughput optimal policy 
when g is nondecreasing, concave. Theorem 3 guarantees the 
existence of an optimal mean delay policy. It is of interest 
to identify one such policy also. In general one can compute 
an optimal policy numerically via Value Iteration or Policy 
Iteration but that can be computationally intensive (especially 
for large data and energy buffer sizes). Also it does not 
provide any insight and requires traffic and energy profile 
statistics. In Section [HI] we also provided a greedy policy 
(O which is very intuitive, and is throughput optimal for 
linear g. However for concave g (including the cost function 



^log(l + jt)) it is not throughput optimal and provides low 
mean delays only for low load. Next we show that it provides 
minimum mean delay for linear g. 

Theorem 4 The Greedy policy (0 is a-discount optimal 
for < a < 1 when g(t) = jt for some 7 > 0. It is also av- 
erage cost optimal. ■ 

The fact that Greedy is a-discount optimal as well as 
average cost optimal implies that it is good not only for 
long term average delay but also for transient mean delays. 

V. Generalizations 

In this section we consider two generalizations. First we 
will extend the results to the case of fading channels and 
then to the case where the sensing and the processing energy 
at a sensor node are non-negligible with respect to the 
transmission energy. 

In case of fading channels, we assume flat fading during a 
slot. In slot k the channel gain is hk. The sequence {hk} 
is assumed stationary, ergodic, independent of the traffic 
sequence {Xk} and the energy generation sequence {Yk}. 
Then if Tk energy is spent in transmission in slot k, the 
{qk} process evolves as 



qk+i 



(q k - g(h k T k )) + + X k 



If the channel state information (CSI) is not known to the 
sensor node, then Tk will depend only on (qk, Ek). One can 
then consider the policies used above. For example we could 
use Tk — min(Ek, E[Y] — e). Then the data queue is stable if 
E[X] < E[g(h(E[Y]-e))}. We will call this policy unfaded 
TO. If we use Greedy (0, then the data queue is stable if 
E[X] < E[g(hY)]. 

If CSI hk is available to the node at time k, then the 
following are the throughput optimal policies. If g is linear, 
then g(x) = (3x for some j3 > 0. Then, if < h < 
h < 00 and P(h = h) > 0, the optimal policy is: 
T(h) = (E[Y\ - e)/p{h = h) and T(h) = otherwise. 
Thus if h can take an arbitrarily large value with positive 
probability, then E[hT(h)] = 00 at the optimal solution. 

If g(x) = hlog(l+f3x), then the water filling (WF) policy 



T k (h) 



1 

h 



(7) 



where ho is obtained from the average power constraint 
E[Tk] = E[Y] — e, is throughput optimal. This is because it 
maximizes hEh[log(l+ (3hT(h))] with the given constraints. 

Both of the above policies can be improved as before, by 
not wasting energy when there is not enough data. As in (O 
in Section [TTTJ we can further improve WF by taking 



T k = min f(q k ),E k , 



1 

h 



1 

7^ 



Qim(Ek-cq k 




We will call it MWF These policies will not minimize mean 
delay. For that, we can use the MDP framework used in 
Section |IV] and numerically compute the optimal policies. 

Till now we assumed that all the energy that a node 
consumes is for transmission. However, sensing, processing 



and receiving (from other nodes) also require significant 
energy, especially in more recent higher end sensor nodes 
([16]). Since we have been considering a single node so far, 
we will now include the energy consumed by sensing and 
processing only. For simplicity, we will assume that the node 
is always in one energy mode (e.g., lower energy modes 
([21]) available for sensor nodes will not be considered). 
If a sensor node with an energy harvesting system can be 
operated in energy neutral operation in normal mode itself 
(i.e., it satisfies the conditions in Lemma 1), then there is 
no need to have lower energy modes. Otherwise one has to 
resort to energy saving modes. 

We will assume that Zk is the energy consumed by 
the node for sensing and processing in slot k. Unlike Tfc 
(which can vary according to qk), {Zk} can be considered a 
stationary, ergodic sequence. The rest of the system is as in 
Section [II] Now we briefly describe a energy management 
policy which is an extension of the TO policy in Section 
HIH This can provide an energy neutral operation in the 
present case. Improved/optimal policies can be obtained for 
this system also but will not be discussed due to lack of 
space. 

Let c be the minimum positive constant such that E[X] < 
g(c). Then if c + E[Z] < E[Y] - 5, (where S is a small 
positive constant) the system can be operated in energy 
neutral operation: If we take Tk = c (which can be done 
with high probability for all k large enough), the process 
{qk} will have a unique stationary, ergodic distribution and 
there will always be energy Zk for sensing and processing 
for all k large enough. The result holds if {(Xk, Yk, Zk)} is 
an ergodic stationary sequence. The arguments to show this 
are similar to those in Section [III] and are omitted. 

When the channel has fading, we need E[X] < E[g(ch)] 
in the above paragraph. 

VI. Simulations for single node 

In this section, we compare the different policies we have 
studied via simulations. The g function is taken as linear 
(g(x) — Wx) or as g(x) = log(l + x) . The sequences {Xk} 
and {Yk} are iid. (We have also done limited simulations 
when {Xk} and {Yk} are Autoregressive and found that 
conclusions drawn in this section continue to hold). We 
consider the cases when X and Y have truncated Poisson, 
exponential, Erlang or Hyperexponential distributions. The 
policies considered are: Greedy, TO, Tk = Yk, MTO (with 
c = 0.1) and the mean delay optimal. At the end, we will 
also consider channels with fading. For fading channels we 
compare unfaded TO and MTO against fading TO and fading 
MTO. For the linear g, we already know that the Greedy 
policy is throughput optimal as well as mean delay optimal. 

The mean queue lengths for the different cases are plotted 
in Figs. EE 

In Fig. [2] we compare Greedy, TO and mean-delay optimal 
(OP) policies for nonlinear g. The OP was computed via 
Policy Iteration. For numerical computations, all quantities 
need to be finite. So we took data and energy buffer sizes 
to be 50 and used quantized versions of qk and Ek- The 




Fig. 2. Mean Delay Optimal, Greedy, TO Policies with No Fading; 
Nonlinear g; Finite, Quantized data and energy buffers; X, Y: Poisson 
truncated at 5; E[Y] = l,E[g(Y)] = 0.92, g(E[Y]) = 1 




Fig. 3. Comparison of policies with No Fading; g(x) = lOx; X, Y: 
Exponential; E[Y] = l,E[g(Y)] = 10, g[E[Y]) = 10 
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Fig. 4. Comparison of policies with No Fading; g(x) = log(l + x)\ X, Y: 
Exponential; E[Y] = 10, E[g(Y)} = 2.01, g(E[Y}) = 2.4 



distribution of X and Y is Poisson truncated at 5. These 
changes were made only for this example. Now g(E[Y\) = 1 
and E[g(Y)] = 0.92. We see that the mean queue length of 
the three policies are negligible till E[X] — 0.8. After that, 
the mean queue length of the Greedy policy rapidly increases 
while performances of the other two policies are comparable 
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Fig. 5. Comparison of policies with Fading; g(x) = lOx; X, Y: 
Hyperexponential(5); E[Y] = l,E[g(Y)} = 10, g(E[Y]) = 10 
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Fig. 6. Comparison of policies with Fading; g(x) = log(l + x); X,Y: 
Erlang(5); E[Y] = 1, E[g{hY)} = 0.62, E\g{hE\Y})} = 0.64; WF, Mod. 
WF stable for E[X] < 0.70 



till 1 (although from E[X] — 0.6 till close to 1, mean queue 
length of TO is approximately double of OP). At low loads, 
Greedy has less mean queue length than TO. 

Fig. |3] considers the case when X and Y are exponential 
and g is linear. Now E[Y] = 1 and g{E[Y]) = E[g(Y)] 
10. Now all the policies considered are throughput optimal 
but their delay performances differ. We observe that the 
policy Tfc = Yk (henceforth called unbuffered) has the worst 
performance. Next is the TO. 

Fig. |4] provides the above results for g nonlinear, when X 
and Y are exponential. Now, as before, Tk = Yk is the worst. 
The Greedy performs better than the other policies for low 
values of S[Jf]. But Greedy becomes unstable at E[g{Y)\ = 
2.01 while the throughput optimal policies become unstable 
at g(E[Y]) = 2.40. Now for higher values of E[X], the 
modified TO performs the best and is close to Greedy at 
low E[X]. 

Figs. |5]|6] provide results for fading channels. The fading 
process {hk} is iid taking values 0.1,0.5,1.0 and 2.2 with 
probabilities 0.1,0.3,0.4 and 0.2 respectively. Fig. [5] is for 
linear g and Fig. [6] is for nonlinear g. The policies compared 
are unbuffered, Greedy, Unfaded TO © and Fading TO. In 



Fig. [6] we have also considered Modified Unfaded TO © 
and Modified Fading TO (MWF) with c = 0.1 ©. 

In Fig. [5] X and Y have Hyperexponential 
distributions. The distribution of r.v. X is a 
mixture of 5 exponential distributions with means 
E[X}/4.9, 2E[X]/4.9, 3E[X]/4.9, 6E[X]/4.9 and 
10E[X]/4.9 and probabilities 0.1,0.2,0.2,0.3 and 0.2 
respectively. The distribution of Y is obtained in the 
same way. Now E[Y] = 1, E[g(hY)} = 10 and 
E[g{hE[Y])] =_ 10. The stability region of fading TO 
is E[X] < E[g(hY)] = 22.0 while that of the other three 
policies is E[X] < 10. However, mean queue length of 
fading TO is larger from the beginning till almost 10. This is 
because in fading TO, we transmit only when h = h = 2.2 
which has a small probability (= 0.2). 

Fig.|6]considers nonlinear g with X, Y Erlang distributed. 
Also, E[Y] = l,E[g{KY)] = 0.62, E[g{hE\Y])] = 0.64. 
Now we see that the stability region of unbuffered and 
Greedy is the smallest, then of TO and MTO while WF 
and MWF provide the largest region and are stable for 
E[X] < 0.70. MTO and MWF provide improvements in 
mean queue lengths over TO and WF. 

VII. Multiple Access Channel 

In a sensor network, all the nodes need to transmit their 
data to a fusion node. Thus, for this a natural network to 
consider is a Tree ([3]). In the present scenario of nodes with 
energy harvesting sources, selection of a Tree can depend on 
the energy profiles of different nodes. This will be subject 
of another study. Here we assume that an appropriate Tree 
has been formed and will concentrate on the link layer. 

An important building block for this network is a multiple 
access channel. In sensor networks contention based (e.g., 
CSMA) and contention-free (TDMA/CDMA/FDMA) MAC 
protocols are considered suitable ([1], [12]). In fact for 
estimation of a random field, contention-free protocols are 
more appropriate. 

We consider the case where N nodes with data queues 
Qi,...,Qn are sharing a wireless channel. Each queue 
generates its traffic, stores in a queue and transmits as in 
Section [TTJ Also, each node has its own energy harvesting 
mechanism. The traffic generated at different queues and 
their energy mechanisms are assumed independent of each 
other. 

Let {Xk(i)}, [Yk(i)} and {Zk(i)} be the sequences 
corresponding to node i. For simplicity we will assume 
{X k (i),k > 0} and {Y k {i),k > 0} to be iid although 
these assumptions can be weakened as for a single queue. As 
mentioned at the end of Section [VJ the energy consumption 
{Zk(i)} can be taken care of if we simply replace 
by — in our algorithms. In the following 

we do that and write it as only (and hence ignore 

Z k {i)). 

The N queues can share the channel in different ways. 
The stability region of Qi, Q%, Qjy and optimal (good) 
transmit policies depend upon the sharing mechanism used. 



We consider a few commonly used scenarios in the rest of 
the paper. 

VIII. Orthogonal Channels 

The N sensor nodes use TDMA/orthogonal 
CDMA/FDMA/OFDMA to transmit. Then the N queues 
become independent, decoupled queues and can be 
considered separately. Thus, the transmission policies 
developed in previous sections for a single queue can be 
used here. In the following we explain them in the context 
of TDMA. 

If the queues have to use the channel in a TDMA fashion 
then necessary conditions for stability of the N queues are 
: There exist ai,ot2, ....otjs[,a.i > and $D i=1 a» = 1 such 
that 



E[X(i)] < at gi 



( E[Y(i 



i= 1,2, ...,N, (9) 



where gi is the energy to bit mapping for Qi. A stable policy 
for each queue will be as in Section Hill Qi is given oti 
fraction of slots (on a long term basis) and it uses energy 
(E[Y(i)] — e)/oti whenever it transmits. For better delay 
performance, the slots allocated to different queues should 
be uniformly spaced. We can improve on the mean delay by 
using ©. 

It is possible that more than one set of (pn, a/v) satisfy 
the stability condition (0. Then one should select the values 
which minimize a cost function, (say) weighted sum of mean 
delays. 

IX. Opportunistic Scheduling for fading 

CHANNELS: ORTHOGONAL CHANNELS 

Now we discuss the MAC with fading. Let {hk(i), k > 1} 
be the channel gain process for Qi. It is assumed stationary, 
ergodic and independent of the fading process for Qj,j ^ i. 
We discuss opportunistic scheduling for the contention free 
MAC. We will study the CSMA based algorithms in the next 
Section. 

If we assume that each of Qi has infinite data backlog, 
then the policy that maximizes the sum of throughputs for 
g(x) — log(l + (3x) and for symmetric statistics (i.e., each 
hi has same statistics and all E[Y(i)] are same) is to choose 

Q 

if. = arg max(hk(i)) (10) 

in slot k and use T^Qi) via the water-filling formula (Q with 
the average power constraint 

E[T k (h)]=NE[Y(it)-e}. 

This is an extension of an algorithm in [11] to the energy 
harvesting nodes. A modification of this policy is available 
in [11] for asymmetric case. 

If g is linear, then for the symmetric case, a channel 
is selected only if it has the highest possible gain (for h 
bounded). If more than one channel is in the best state, select 
one of them with equal probability. 

Although ([Tol l maximizes throughput, it may be unfair to 
different queues and may not provide the QoS. Furthermore, 



in our setup infinite backlog is not a realistic assumption. 
Without this assumption a throughput optimal policy (in the 
class of policies which use constant powers) is to choose 
queue 



arg max q k (i)gi h k (i 



E[Y{i)\ - 
a(i) 



(11) 



and then use T k = {E[Y{i* k )\ - e)/a{i* k ). Here a(i%) is 
the fraction of time slots assigned to i k . However now 
we do not know and this may be estimated (see 

the end of this section). If a(i k ) is replaced with the true 
value, then stability of the queues in the MAC follows 
from [5] if the fading states take values in a finite set 
and the system satisfies the following condition. Let there 
exist a function f(r k (l), ...,rk(N)) which picks one of the 
queues as a function of ( ffc(l), rfc(iV)) where ffc(i) = 
.9, (h k {i) E[Y ™- e ), a{i) ± £Ul{/(n,...,rx) = *}] and 
7r(ri, rjv) is the stationary distribution of (n, rjy). 
Then if E[X(i)] < T,nl{f{ri,...,r N ) = i}*(ri, ...,r N ) 
for each i, the system is stable. This policy tries to satisfy 
the traffic requirements of different nodes but may not be 
delay optimal. Based on experience in [19], a Greedy policy 



i* k = arg max min gi h k (i) 



E[Y(i)\ 
a(i) 



,Qk(i) 



(12) 

provides better mean delays. However, it is throughput opti- 
mal only for symmetric traffic statistics and when EY(i)] — 
E[Y(j)], for all But it can be made throughput optimal 
(as (fTTT i) while still retaining (partially) its mean delay per- 
formance as follows. Choose an appropriately large positive 
constant L. If none of q k {i) is greater than L, use ( fT2l ; 
otherwise, on the set {i : q k (i) > L,}, use (fTTT i. We call 
this Modified Greedy Policy. 

The mean delay of the above policies can be further 
improved, if instead of T k = (E[Y(i)} — e)/a(i), we use 
©. But the stability region remains same. 

The policies (fTTT i and (fT2l can be further improved if 
instead of using T k (i) — E[Y(i)] — e, we use waterfilling 
for g in ©. Of course we reduce transmit power as in © if 
there is not enough data to transmit. Now not only the mean 
delays reduce but the stability region also enlarges. 

The policies (fTTT i. (flZb and Modified Greedy provide good 
performance, require minimal information (only E[Y(i)]), 
are easy to implement and have low computational require- 
ments. In addition they naturally adapt to changing traffic 
and channel conditions. 

In (fTTT l- (fT2l) we need a(i) to obtain the energy T k . But 
unlike for TDMA, a(i k ) is not available in these algorithms 
and depends on the algorithm used. Thus, in these algorithms 
we use a simple variant of the LMS (Least Mean Square) 
algorithm ([6]) to estimate a(i): 

Initially start with guess 



ao(i) = jt, i = 1, -,N. 



Run the algorithm for (say) L\ number of slots. Each node 
% computes the fraction a (i) of slots it gets and recomputes 



a n +i{i) = a n (i) - (J,(a„(i) - a («)) 



(13) 



where /i is a small postive constant. 

At any time the current estimate of a(i) is used by the 
algorithms. 

X. Opportunistic scheduling for fading 
channels: CSMA 

Since ZigBee and 802.11 use CSMA, we discuss oppor- 
tunistic scheduling for CSMA also. As against ([TTTi-([T2"i>. this 
is a completely decentralized algorithm. This is used by Zhao 
and Tong [26]. The basic idea in [26] is to make the back-off 
mechanism in a node to be a function of the channel state of 
that node. The nodes that are to be given priority are given 
smaller back-off time. This mechanism has also been used in 
IEEE 802.1 le to provide priority to voice and video traffic. 
In this section we take this idea further by also including the 
effect of queue lengths and power control in deciding the 
back-off interval as against only the channel state in [26]. 

Let / be a nonincreasing function with values in [0, r max ] 
where T max is the maximum allowed back-off time in slots. 
If h is the channel gain in a slot then in [26] the back-off 
time is taken to be f(h). 

In our setup, to use opportunistic scheduling in CSMA, 
we use the above mentioned monotonic function / on each 
of the sensor nodes contending for the channel. The Q, uses 
the back off time of 



/ \9i\ h k {i) 



(E[Y(i 



a(i) 



(14) 



When a node gets the channel, it will transmit a complete 
packet and use energy per slot as 



T k (i%) = [E[Y(i%)] - e]/a(il) 



(15) 



Now we are making the usual assumption that the channel 
gains stay constant during the transmission of a packet. We 
can use (|6]l to improve performance. 

Using the ideas in the last section we can develop better 
algorithms than (fl4l-(fT5]>. Indeed, with (fl4l , instead of ( TBI , 
we can use waterfilling (for g in (01). We can also improve 
over (fl4l by using, for back-off time of ith node, 



/ Qk(i) 9i h k (i) 



a{i) 



(16) 



which takes care of the traffic requirements of different 
nodes. 

We can also use the (modified) Greedy in 112) . The a(i) 
in the above algorithms will be computed via LMS in ( fT3l . 

We will compare the performance of these algorithms via 
simulations in Section IXj 

An advantage of above algorithms over the algorithms 
in Section [IX] are that these are completely decentralized: 
Each node uses only its own queue length, channel state 
and to decide when to transmit. The algorithms 



in Section [IX] require a central controller (may be a clus- 
ter head) for implementation. Centralized algorithms have 
also been considered in sensor networks and provide better 
performance. 

XI. Simulations for MAC Protocols 

In this section for simplicity, we simulate the system under 
symmetric conditions, apply the different algorithms, and 
compare their performances. We use g(x) = log{\ + x). 
The fading of each channel changes from slot to slot in- 
dependently; {hk(i),k > 1} are iid with values 0.1, 0.6, 
1.8 and 5 with probabilities 4/12, 5/12, 2/12 and 1/12. The 
{Xk(i),k > 1} and {Y k (i),k > 1} are iid expontential. 
The LMS ( fT3l ) was taken with /i = 0.01 and the a^s were 
updated after 30-50 slots. 
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Fig. 7. Orthogonal Channels: Symmetric, 3 Queues 
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Fig. 8. CSMA: Mean Delay, Symmetric 10 Queues 

For orthogonal channels, under symmetric conditions with 
3 queues, average queue lengths are shown in Fig. [7] for 
TO £ED, Greedy (O, TDMA, Greedy with water-filling and 
TDMA with water-filling policies. The {h k (i), k > 1} are iid 
exponential with mean 1 . For symmetric conditions, Greedy 
is throughput optimal and hence Modified Greedy is not 
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Fig. 9. CSMA: Packet Loss Probability, Symmetric 10 Queues 



implemented. We see that TDMA becomes unstable much 
before the other policies, and that its average queue length 
is much worse even when it is stable. Greedy performs better 
than TO near the stability boundary which is E[X] = 0.39. 
Water-filling improves the stability region of TDMA as well 
as Greedy. 

For CSMA, Figs. [8] and [9] show mean delays and 
packet loss probabilities under symmetric conditions with 
10 queues and with normal exponential backoff, Zhao- 
Tong [26], our policy (fTol l and our policy with water-filling 
(with fpoiicy(x) = Ppoiicy/x and Ef = 1.55 at EX = 
0.17; h assumes values 0.1,0.5,1.0,2.2 for time fractions 
0.1,0.3,0.4,0.2). We simulated the 10 queues in continuous 
time. Also, E[Y] = 1 and the data packets of unit size arrive 
at each queue as Poisson streams. We see that opportunistic 
policies improve mean delays substantially. 

XII. Conclusions 

We have considered sensor nodes with energy harvesting 
sources, deployed for random field estimation. Throughput 
optimal and mean delay optimal energy management policies 
for single nodes are identified which can make them work in 
energy neutral operation. Next these results are extended to 
fading channels and when energy at the sensor node is also 
consumed in sensing and data processing. Similarly we can 
include leakage/wastage of energy when it is stored in the 
energy buffer and when it is extracted. Finally these policies 
are used to develop efficient MAC protocols for such nodes. 
In particular versions of TDMA, opportunistic MACs for fad- 
ing channels and CSMA are developed. Their performance 
is compared via simulations. It is shown that opportunistic 
policies can substantially improve the performance. 
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